Overview of the 1th International Competition on Quality Flaw Prediction in Wikipedia

نویسندگان

  • Maik Anderka
  • Benno Stein
چکیده

The paper overviews the task “Quality Flaw Prediction in Wikipedia” of the PAN’12 competition. An evaluation corpus is introduced which comprises 1 592 226 English Wikipedia articles, of which 208 228 have been tagged to contain one of ten important quality flaws. Moreover, the performance of three quality flaw classifiers is evaluated. Pamela Forner, Jussi Karlgren, and Christa Womser-Hacker (Eds.): CLEF 2012 Evaluation Labs and Workshop – Working Notes Papers, 17-20 September, Rome, Italy. ISBN 978-88-904810-3-1. ISSN 2038-4963. 2012.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Overview of the 1st International Competition on Quality Flaw Prediction in Wikipedia

The paper overviews the task “Quality Flaw Prediction in Wikipedia” of the PAN’12 competition. An evaluation corpus is introduced which comprises 1 592 226 English Wikipedia articles, of which 208 228 have been tagged to contain one of ten important quality flaws. Moreover, the performance of three quality flaw classifiers is evaluated.

متن کامل

On the Use of PU Learning for Quality Flaw Prediction in Wikipedia

In this article we describe a new approach to assess Quality Flaw Prediction in Wikipedia. The partially supervised method studied, called PU Learning, has been successfully applied in classifications tasks with traditional corpora like Reuters-21578 or 20-Newsgroups. To the best of our knowledge, this is the first time that it is applied in this domain. Throughout this paper, we describe how t...

متن کامل

Fatigue Life Prediction of Rivet Joints

Strength reduction in structures like an aircraft could be resulted as cyclic loads over a period of time and is an important factor for structural life prediction. Service loads are emphasized at the regions of stress concentration, mostly at the connection of components. The initial flaw prompting the service life was expected by using the Equivalent Initial Flaw Size (EIFS) which has been re...

متن کامل

FlawFinder: A Modular System for Predicting Quality Flaws in Wikipedia

With over 23 million articles in 285 languages, Wikipedia is the largest free knowledge base on the web. Due to its open nature, everybody is allowed to access and edit the contents of this huge encyclopedia. As a downside of this open access policy, quality assessment of the content becomes a critical issue and is hardly manageable without computational assistance. In this paper, we present Fl...

متن کامل

Overview of the 2nd International Competition on Wikipedia Vandalism Detection

The paper overviews the vandalism detection task of the PAN’11 competition. A new corpus is introduced which comprises about 30 000 Wikipedia edits in the languages English, German and Spanish as well as the necessary crowdsourced annotations. Moreover, the performance of three vandalism detectors is evaluated and compared to those of the PAN’10 competition. Vivien Petras and Paul Clough (Eds.)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012